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-Abstract- 

A new technique is presented to prove non-termination of term rewriting. The basic idea is to 
find a non-empty regular language of terms that is closed under rewriting and does not contain 
normal forms. It is automated by representing the language by a tree automaton with a fixed 
number of states, and expressing the mentioned requirements in a SAT formula. Satisfiability 
of this formula implies non-termination. Our approach succeeds for many examples where all 
earlier techniques fail, for instance for the S-rule from combinatory logic. 
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1 Introduction 

A basic approach for proving that a term rewriting system (TRS) is non-terminating is to 
prove that it admits a loop , that is, a reduction of the shape t —>+ C[to r], see [27]. Indeed, 
such a loop gives rise to an infinite reduction t — C[ta\ — C[(C[ta])a] —>■••• in which in 
every step t is replaced by C[ta\. In trying to prove non-termination, several tools ([HE]) 
search for a loop. An extension from [7], implemented in [T] goes a step further: it searches for 
reductions of the shape ta n /i —> + C[fcr^ n V T ] for every n for a linear increasing function /, 
and some extensions. All of these patterns are chosen to be extended to an infinite reduction 
in an obvious way, hence proving non-termination. However, many non-terminating TRSs 
exist not admitting an infinite reduction of this regular shape, or the technique from [7] fails 
to find it. 

A crucial example is the S'-rule a(a(a(S, x), y), z) —> a(a(x , z),a(y , z)), one of the building 
blocks of Combinatory Logic. Although being only one single rule, and having nice properties 
like orthogonality, non-termination of this system is a hard issue. Infinite reductions are 
known, but are of a complicated shape, see [32]. So developing a general technique that can 
prove non-termination of the S-rule automatically is a great challenge. In this paper we 
succeed in presenting such a technique, and we describe a SAT-based approach by which 
non-termination of many TRSs, including the S-rule, is proved fully automatically. 


* This is an extended version of the paper |23| published at RTA 2015. This extension includes trans¬ 
formation for strengthening the presented non-termination techniques (see Remarks [8] and [9j, a detailed 
description of the SAT encoding of the improved technique in Section [sj (s ee RemarF |39[ and a remark 
on the completeness of the method with respect to loops (see Remar k|40l. 
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The underlying idea is quite simple: non-termination immediately follows from the 
existence of a non-empty set of terms that is closed under rewriting and does not contain 
normal forms. Our approach is to find such a set being the language accepted by a finite tree 
automaton, and find this tree automaton from the satisfying assignment of a SAT formula 
describing the above requirements. Hence the goal is to describe the requirements, namely 
non-emptiness, closed under rewriting, and not containing normal forms, in a SAT formula. 

We want to stress that having quick methods for proving non-termination of term rewriting 
also may be fruitful for proving termination. In a typical search for a termination proof, like 
using the dependency pair framework, the original problem is transformed in several ways 
to other termination problems that not all need to be terminating. Being able to quickly 
recognize non-termination of some of them makes a further search for termination proofs 
redundant, which may speed up the overall search for a termination proof. 

We note that, like termination, non-termination is an undecidable property. However, 
while termination is n^-complete, non-termination is S^-complete PH HD]. 

The paper is organized as follows. In Section [2] we present our basic approach in the 
setting of abstract reduction systems on a set T, in which the language is just a subset of T. 
Surprisingly, being not weakly normalizing corresponds to (strongly) closed under rewriting, 
and being not strongly normalizing corresponds to weakly closed under rewriting. In Section 
[3] we give preliminaries on tree automata and show how string automata can be seen as an 
instance of tree automata. In Section [4] we present our basic methods, starting by how the 
requirements are expressed in SAT, and next how this is used to disprove weak normalization 
and strong normalization. In Section [5] we strengthen our approach by labeling the states of 
the tree automata by sets of rewrite rules and exploiting this in the method. In Section [6] we 
present experimental results of our implementation. We conclude in Section [7] 

Related Work 

The paper m introduces the notion of loops and investigates necessary conditions for the 
existence of them. The work ;35] employs SAT solvers to find loops, [361 uses forward closures 
to find loops efficiently, and [37] introduces ‘compressed loops’ to find certain forms of very 
long loops. Non-termination beyond loops has been investigated in [30] and [7]. There the 
basic idea is the search for a particular generalization of loops, like a term t and substitutions 
a, t such that for every n there exist C, /i such that ta n r rewrites to C[ta^ n ^Tfjb], for some 
ascending linear function /. Although the 5-rule admits such reductions, these techniques 
fail to find them. For other examples for which not even reductions exist of the shape studied 
in m and [7], we will be able to prove non-termination fully automatically. 

Our approach can be summarized as searching for non-termination proofs based on 
regular (tree) automata. Regular (tree) automata have been fruitfully applied to a wide 
rage of properties of term rewriting systems: for proving termination infinitary 

normalization JT2] , liveness [25] , and for analyzing reachability and deciding the existence 
of common reducts [Ml US- Local termination on regular languages, has been investigated 
in 0. 

2 Abstract Rewriting 

An abstract reduction system (ARS) is a binary relation —> on a set T. We write —for the 
transitive closure, and —for the reflexive, transitive closure of —K 

Let —> be an ARS on T. The ARS —> is called terminating or strongly normalizing (SN) 
if no infinite sequence to,ti,t 2 , ■ ■ ■ £ T exists such that t t —> U+ 1 for all i > 0. A normal 
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form with respect to —¥ is an element t £ T such that no u £ T exists satisfying t —» u. The 
set of all normal forms with respect to —► is denoted by NF(—►). The ARS —► is called weakly 
normalizing (WN) if for every t £ T a normal form u £ T exists such that t —>* u. 

► Definition 1. A set L CT is called 

h closed under —¥ if for all t £ L and all u £ T satisfying t —> u it holds u £ L, and 
h weakly closed under —> if for all t £ L \ NF(—►) there exists u £ L such that t —¥ u. 

It is straightforward from these definitions that if L is closed under —>, then L is weakly 
closed under —► as well. The following theorems relate these notions to SN and WN. 

► Theorem 2. An ARS —► on T is not SN if and only if a non-empty L C T exists such 
that L (~l NF (—>) = 0 and L is weakly closed under —> + . 

Proof. If —> is not SN then an infinite sequence to, ti,t 2 , ■ ■ ■ £ T exists such that ti —>• t l+ \ 
for all i > 0. Then L = {ti \ i > 0} satisfies the required properties. 

Conversely, assume L satisfies the given properties. Since L is non-empty we can choose 
to € L, and using the other properties for i = 0,1,2,3,... we can choose ti+ 1 £ L such that 
ti — t i+ 1 , proving that — > is not SN. ◄ 

► Theorem 3. An ARS on T is not WN if and only if a non-empty L C T exists such 
that L (~l NF(—►) = 0 and L is closed under —>. 

Proof. If —> is not WN then t £ T exists such that Lr 1 NF(—>) = 0 for L = {u £ T \ t —u}. 
Then L satisfies the required properties. 

Conversely, assume L satisfies the given properties. Since L is non-empty we can choose 
to £ L. Assume that —> is WN, then to ^~► 1 ■ ■ —^► t n exists such that t n £ NF (—>). Since 
L is closed under —»• we obtain ti £ L for i = 1,2,, n, contradicting L n NF (—>) =0. ◄ 

A variant of Theorem [2j where —> + is replaced by —has been observed in [3]. To the 
best knowledge of the authors, Theorem [3] has not been observed in the literature. 

3 Tree Automata 

► Definition 4. A (non-deterministic finite) tree automaton A over a signature E is a tuple 
A = ( Q,Yi,F,5) where 

(i) Q is a finite set of states, 

(ii) F C Q is a set of accepting states, and 

(Hi) S a set of rewrite rules, called transition rules, of the shape 

fill, Q 

where n is the arity of / £ E and qi,... ,q n ,q £ Q. We write for the rewrite relation 
generated by the rules <5. 

Note that we use to distinguish automata transitions from term rewriting —> with respect 
to some TRS R. 

► Definition 5. The language C(A) accepted by A is the set 


C(A) = {t | t £ T( E, 0), q £ F, t -A q } 
of ground terms that rewrite to a final state. 


4 


Proving Looping and Non-Looping Non-Termination by Finite Automata 


The kind of tree automata considered here is called bottom up in the literature. Sometimes 
in the definition of bottom-up tree automaton the right hand side q in the rule has arguments 
and the acceptance criterion is rewriting to a term with a final state as root. However, when 
tree automata are only used for defining (term) languages as is the case in this paper, these 
definitions coincide. 

Tree automata can be seen as a generalization of string automata as follows. For a string 
automaton (= NFA) S define the tree automaton A by 

h taking the same sets of states and accepting states, and 

h taking as signature the same signature in which all symbols are unary, extended by a 
single constant e, and 

h taking as transition rules e -w qo for q 0 being the initial state of S , and for every transition 
q A q' in S the rule a(q) q'. 


Form this definition it is immediate that a string a\a 2 ■ ■ ■ a n is accepted by S if and only if 
a n (a n - 1 (- • • (ai(e)) • • •)) is accepted by A. Here we assume that S reads the string from left 
to right (otherwise there is no need to reverse the order of the letters). 


► Example 6. To define a tree automaton accepting the language b a* ( L\R ) a* b , that is, 
all words that start with 6, end with b , contain one L or R and otherwise only a, we start by 
its corresponding string automaton 



The above construction yields the tree 
automaton Arr = (Q,H,F, S) where 
E = {6, L, R, a,e} in which 6, L, R, a 
are unary and £ is a constant, Q = 
{0,1,2,3}, F = {3} and S consists of 
the rules 


£ 0 a(l) 1 6(0) 1 

a(2) 2 6(2) -w 3 


R( 1) 2 L( 1) 2 


► Example 7. The following is a tree automaton for the signature E = {a, 5} where a is 
binary and 5 is a constant. Let As = ( Q, E, F, S) where Q = {0,1, 2,3,4}, F = {4} and 


a(0,0)~*l a(l,0)~»2 a(2,2) 3 a(3,3)~»3 

a(0,2) 2 a(2,3) 3 a(3,3)~»4 

a(0, 3) 2 


As is usual in combinatory logic, ground terms are represented by omitting the a symbol and 
writing uvw = ( uv)w. We show that this automaton accepts the term SSS(SSS)(SSS(SSS)): 

SSS{SSS){SSS{SSS)) ^ 12 000(000)(000(000)) 

10(10)(10(10)) 22(22) 33 -* 1 4 


Since 4 £ F the term is accepted by the automaton. 

This automaton has been found automatically by our tool, and its language is closely 
related to the QQQ-criterion of Waldmann |31 E] Roughly speaking, the language recognized 
by this automaton can be described as follows: 

h state 0 accepts only the term S, 

m state 1 accepts only the term SS, 

m state 2 corresponds to terms that contain at least one occurrence of SSS, 

m state 3 corresponds to terms that contain at least two occurrence of SSS, and 

h state 4 accepts terms MN for which both M and N contain two occurrences of SSS. 
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4 Basic Methods 

In this section, we are concerned with automating the abstract non-termination methods 
from Section [2] To this end, we use finite tree automata giving rise to regular tree languages. 
We first develop methods for disproving weak normalization and then for disproving strong 
normalization. 

The applicability of the non-termination techniques described in the remainder of this 
paper can be improved by two simple transformations of term rewrite systems R. The first 
transformation concerns the introduction of a fresh constant in E (see Remark [ 8 ]) and the 
second transformation describes the elimination of collapsing rules in R (Remark [9]) . Both 
transformation do not affect (weak) normalisation of R. 

► Remark 8 (Adding fresh constants). Let R be a TRS over a signature E. If we are interested 
in non-termination of R on arbitrary terms (including non-ground terms), then, without loss 
of generality, we can extend the signature E with a fresh constant c (a symbol of arity 0). 
Since the fresh constant can be thought of as a variable, this transformation neither affects 
weak nor strong normalisation of R. Moreover, by adding a constant, we can reduce strong 
normalisation of R to strong normalisation of R on all ground terms. The reason is that, if 
the signature contains at least one constant symbol, then both properties coincide. 

Note that adding a single fresh constant does not not suffice for weak normalisation. 
The addition of multiple fresh constants might be needed to make weak normalisation of R 
coincide with weak normalisation of R on ground terms. 

In this paper, we are interested in disproving (weakly) normalisation. The automata 
techniques we employ in this paper actually yield the the stronger property that R is not 
(weakly) normalising on ground terms. Therefore we tacitly assume that the signature is 
extended with a fresh constant (if it does not already contain one). 

► Remark 9 (Elimination of collapsing rules). Let I? be a TRS over a signature E. For the 

techniques in the remainder by first eliminating collapsing rules, that is, rules of the form 
£ —> x with x £ X. Assume that the TRS R contains a collapsing rule £ —> x. For every 
/ £ E we define the substitution 07 : X —>• T(E, X) by = /(aq,..., £#(/)) for fresh 

variables aq,..., £#(/) and <Jf(y) = y for all y ^ x. We define 

R' = (R \ {£ —> x}) U {£<jf —> xaf | / € E} 

Then R and R' induce the same rewrite relation on ground terms. Hence R' is (weakly) 
ground normalizing if and only if R is. 

Moreover, R' is strongly normalising if and only if R is. This can be seen as follows. The 
‘if’-direction follows immediately from — >r< C — y R . For the ‘only if’-direction assume that R 
admits an infinite rewrite sequence t\ -^r £2 —>r £3 —>r ■ ■ •. Let / G E and let aq,..., x#(/) 
be fresh variables. Define a substitution a by a(x) = /(aq,..., x#(f)) for all x £ X. Then 
H 0 " —>R> t 2 <y —>r> t^a —>ri • • • is an infinite rewrite sequence in R!. 

4.1 SAT Encoding of Properties 

In this section, we collect decision procedures for the main properties of tree automata that 
we employ for proving non-termination, and we describe how we encode these procedures as 
Boolean satisfiability problems (SAT). 

► Remark 10 (SAT encoding of tree automata). We encode the search for a tree automaton 
A = ( Q, E, F, S) over a signature E as a satisfiability problem as follows. We pick the number 
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of states n £ N the automaton should have; the set of states is Q = {si,..., s„}. While the 
set of states Q is fix, we represent the final states F C Q by n fresh Boolean variables 

VF,si j v F,s 2 , Vf,s 3 ,■■■, VF,s„ 

and, for every f £ S, we represent the transition relation 8 by fresh variables 

Vf, qi ,...,q# U)> q for every q u ..., q #{f) , q £ Q 

For the moment, there are no constraints (formulas) and the interpretation of these variables 
can be chosen freely. The intention is that VF, Si is true if and only if s* is a final state, and 
v f,qi,---,q# ( f),q t rue if and only if f(qi ,..., ?#(/)) q is a transition rule in 8. 

► Definition 11. A state q £ Q of a tree automaton A = (Q,S,F,8) is called reachable if 
there exists a ground term t £ T(S, 0) such that t q. 

We assume, without loss of generality, that all states are reachable. Note that requiring 
that all states are reachable is not a restriction since we can always replace unreachable 
states by ‘copies’ of reachable states. We guarantee reachability as follows. 

► Lemma 12. Let A = (Q, £, F,S) be a tree automaton. Then all states of A are reachable 
if and only if there exists a total well-founded order < on the states Q such that for every 
q £ Q there exists f £ £ and states q\ < q, q 2 < q, . . . , q#tf) < q with f(qi ,..., <?#(/)) q. 

Proof. The reachable states are the smallest set Q' C Q that is closed under 8, that is, 
q £ Q' whenever f(qi ,..., q#(f)) ^ q for some / £ S and states q±,..., q#(f) £ Q'■ 

For the ‘if’-part, assume that there was a non-reachable state. Let q £ Q be the smallest 
non-reachable state with respect to the order <. By assumption there exist / £S and states 
qi < q, q 2 < q, ■■■, q#(f) < q with f(qi ,..., q#(f)) q■ By choice of q it follows that all 
states qi, g 2 ) • • ■ i <?#(/) are reachable, and hence q is reachable, contradicting the assumption. 

For the ‘only if’-part, assume that all states are reachable. Then Q is the result of stepwise 
closing 0 under 8. There exists a sequence of states 0 = Qo Q Qi Q ■ ■ ■ Q Q\q\ = Q such 
that for every 0 < i < |<2| we have Qi+\ = Qi U {qi} for some qi £ Q\Qi such that there are 
fi £ £ and states q ^,..., qi,#^) £ Qi with fi{q it i ,%,#(/)) qi- The order < induced 
by qo < qi < ■ • ■ < 9IQI-1 is a total order on the states with the desired property. ◄ 

► Remark 13 (SAT encoding of reachability of all states). We extend the encoding of tree 
automata as described in Remark [To] We want to guarantee that all states are reachable by 
employing Lemma |12| However, instead of encoding an arbitrary well-founded relation, we 
make use of the fact that the names of states are irrelevant. Hence, without loss of generality 
(modulo renaming of states), we may assume that Si < s 2 < ■ ■ ■ < s n . We then encode the 
condition of Lemma [12] by formulas 

V v f,qi,---,qn,q 

/GS, q!<q,..., q #M <q 

for every q £ Q. 

The following lemma is immediate. 

► Lemma 14. Let A= (Q,T, 7 F,8) be a tree automaton such that all states are reachable. 
Then C(A) ^ 0 if and only if F ^ 0. 
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► Remark 15 (SAT encoding of C(A) yf 0 ). In a setting where all states are reachable, the 
encoding of C(A) yf 0 as satisfiability problem trivializes to: V 9 eQ Vf ,q- 

The following lemma gives a simple criterion for closure under rewriting. 


► Lemma 16 (Genet [25] Proposition 12]). Let A = (Q,H, F, S) be a tree automaton and R 
a left-linear term rewriting system. Then C{A) is closed under rewriting with respect to R if 
for every £ -^ r £ R, a : X Q and q £ Q we have to q => ra q. 

Note that left-linearity of R is crucial for the Lemma [l 6 | since A can be a non-deterministic 
automaton. If R would contain non-left-linear rules £ — > r then we would need to check 
set-assignments a : X —> p(Q) instead a : X —> Q. That is, we would need to take into 
account, that a non-deterministic automaton can interpret the same term by different states. 

For terms t, we use Var(t) to denote the set of variables occurring in t. 


► Remark 17 (SAT encoding of closure under rewriting). We encode the conditions of Lemma[16| 


Let the automaton A be encoded as in Remark m Let U be the set of all non-variable 
subterms of left-hand sides and right-hand sides of rules in R. For every t £ U, assignment 
a : Var(t) —> Q and q £ Q we introduce a fresh variable 


Vt : a,q with the intended meaning: Vt >a ,q is true •<=>• to q . 

We ensure this meaning by the following formulas: for terms t = f(ti ,..., t n ) £ U 

Vt,ot.,q ^ ^ V K l,CKi,gi A ... A Vt„,a„,q n A v f,qi,...,q n ,q) 

qi,...,q n £Q 

where on is the restriction of a to the domain Var(ti). For variables x £ U, we stipulate 
Vx,a,q a(x) = q\ note that we can immediately evaluate and fill in these truth values. 

Finally, we encode to q => ra q by formulas 


^ ^r,a.q 

for every i -» r £ R, a : Var(£) —> Q and q £ Q. 

The following modification of Lemma [16] gives a simple criterion for weak closure under 
rewriting. The requirement ra q of Lemma 16 is weakened to to q for some reduct 
t the left-hand side t. 


► Lemma 18. Let A = (Q, S, F, S) be a tree automaton and R a left-linear term rewriting 
system. Then C{A) is weakly closed under rewriting with respect to R if for every i r £ R, 
a : X —► Q and q £ Q we have to q => to q for some term t such that l —t. 

Proof. Let s £ C(A) \ NF(—>#). Then s = C\lo\ for a context C, rewrite rule i —► r € R 
and substitution a : X —> T(£,0). Since s £ C(A) there exists q £ Q such that to q 
and C[q] q' with q' £ F. By left-linearity l does not contain duplicated occurrences of 
variables. As a consequence, there exists a : X —> Q such that cr(x) a(x) and to q. 
By the assumptions of the lemma, a term t exists such that t and to q. Hence 

to q and C[to\ C[q] q'. Thus C[to\ £ C(A). Since s = C\lo\ — C[to\, this 
proves that C(A) is weakly closed under rewriting with respect to R. ◄ 


► Remark 19 (SAT encoding of Lemma 

similar to Lemma 


16 


The conditions of Lemma [18| can be encoded 
(described in Remark . In Lemma [ 


18 


the condition ra 


q is 


weakened to: to q for some reduct t the left-hand side i. We can pick a finite set of 
reducts U C {t \ £ — t} of the left-hand side £, and encode the disjunction \J t&u ta q 
as a Boolean satisfiability problem. Note that U yf 0 since r £U. 
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Next, we want to guarantee that the language C(A ) contains no normal forms, in other 
words, that every term in the language contains a redex. For left-linear term rewriting 
systems R, we can reduce this problem to language inclusion C(A) C C(B) where B is a 
tree automaton that accepts the language of reducible terms. If R is a left-linear rewrite 
system, then the set of ground terms containing redex occurrences is a regular tree language. 
A deterministic automaton B for this language can be constructed using the overlap-closure 
of subterms of left-hand sides, see further pans]. Here, we do not repeat the construction, 
but state the lemma that we will employ: 

► Lemma 20. Let {f \,... , £„} be a set of linear terms over E. Then we can construct a 
deterministic and complete automaton B = (Q, E, F, 6) and sets ,..., Fg n C Q such that 
for every term t £ T(E, 0 ) and i £ {1,..., n} we have: 

h t q with q £ Fg. if and only if t is an instance of 

Note that by choosing F = F^ U ... U Fg n we obtain: t q with q £ F if and only if t is 

an instance ii for some is{l,...,n}. 

► Example 21. The following tree automaton Bs = ( Q,T,,F,S ) accepts the language of 
ground terms that contain a redex occurrence with respect to the S'-rule a(a(a(S, x),y), z) —» 
a(a(x, z),a(y, z)). Here Q = {0,1, 2, 3}, E = {a, 5}, F = {3} and 

S 0 a( 0 ,< 7 ) 1 a(l,q)-^>2 a(2,g)~>3 a(3,q)~>3 a(q', 3) 3 

for all q £ { 0 , 1 , 2 } and q' £ { 0 , 1 , 2 ,3}. 

Since the automaton Bs is deterministic and complete, we can obtain an automaton 
Bs = (<2,E,F,£) that accepts the complement of the language (the language of ground 
normal forms) by taking the complement F = {0,1, 2} of the set of final states. 

The following is crucial for feasibility of our approach. Deciding language inclusion of non- 
deterministic automata is known to be EXPTIME complete, see m- However, to guarantee 
that a language contains no normal forms, it suffices to check whether two non-deterministic 
automata have a non-empty intersection. This property can be decided in polynomial time 
by constructing the product automaton and considering the reachable states. 

► Definition 22. The product AxB of tree automata A = (Q, E, F, 6} and B = ( Q ', E, F ', S ') 
is the tree automaton C = ( Q x Q ', E, F x F', 7 ) where the transition relation 7 is given by 

/( (qi,Pl),---,(q n ,Pn) ) f(qi> <i A f(pi,...,Pn) ~^5' 

for every / £ E of arity n and states q\,, q n . q' £ Q and p±,... ,p n ,p' G Q' . 

► Lemma 23. Let A = (Q,S,F,S) and B = ( Q\ E, F', S') be tree automata. Then we have 
C(A) ft C{B) = 0 if and only if in Ax B no state in F x F' is reachable. 

Proof. Let A x B = (Q x Q', E, 0 , 7 ). For the ‘if’-part, assume that C{A) n C(B) ^ 0 . Let 
t £ C(A) n C(B). Then t q for some q £ F and t q' for some q' £ F'. But then 
t (q, q') and hence (q, q') £ F x F' is reachable in AxB ; this contradicts the assumption. 

For the ‘only if’-part, assume, for a contradiction, that t (q, q') inixB with q £ F 
and q' £ F'. Then this directly translates to t -wj q in A and t q' in B. Hence t £ C(A) 
and t £ C(B), contradicting C(A) D C(B) = 0 . ◄ 

We can use Lemma [23] to check that the language C{A) of an automaton A does not 
contain normal forms. To this end, we only need an automaton B that accepts all ground 
normal forms. Then C(A) contains no normal forms if C(A) f! C(B) = 0 . 
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► Example 24. The reachable states of the product As x Bs of the automata As from 
Example [ 7 ] and Bs from Example 21 are (0,0), (1,1), (2, 2), (2,1), (3,3), (3,2), (2,3), (4, 3). 


The only state (q, q') such that q is accepting in As is (4,3) and 3 is not an accepting state 

are fulfilled and hence C(As) D C(B$) = 0. Recall that 


23 


of Bs- The conditions of Lemma 
Bs accepts all ground normal forms, and thus every term accepted by As contains a redex. 

► Remark 25 (SAT encoding of empty intersection). Let A = (Q, £, F, S) and B = 

( Q', £, F', S') be tree automata. Let A x B = (Q x Q',£, 0 , 7 ). 

First, note that reachability of all states in the automata A and B does not imply that 
all states in the product automaton A x B are reachable. As a consequence, we have to 
‘compute’ the set of reachable states using Boolean satisfiability problems. For this purpose, 


we reformulate Lemma 23 in the following equivalent way: ..., then £(A) n C(B) = 0 if 


and only if there exists a set of states P C Q x Q' such that 

(i) P is closed under transitions in A x B , that is, q £ P whenever f(q ±,..., q n ) ~-> 7 q for 
some <?i,..., q n £ P, and 

(ii) for all (q, q') £ P it holds that q £ F implies q' ^ F'. 


Note that this statement is equivalent to Lemma 23 Item (i) guarantees that P contains 


all reachable states, and hence (ii) is required for at least the reachable states. Thus the 
conditions imply those of Lemma |23| On the other hand, we can take P to be precisely the 
set of reachable states, and then the conditions are exactly those of Lemma [23] 

The idea is that the reformulated statement has a much more efficient encoding as Boolean 
satisfiability problem. We only need to encode the closure of P under transitions, but there 
is no longer the need for encoding the property that P is the smallest such set (which is a 
statement of second-order logic). 

Assume that we have a SAT encoding of the automata A and B as in Remark |10| we 
write va,... for the variables encoding A , and Vb,... for the variables encoding B. To represent 
the set P, we introduce variables P( q , q >) for every (q,q') £ Q x Q' and the properties are 
translated into the following formulas: 

(i) for every f £ T, with arity n and (<?i, q [),... , (q n , q' n ), (q, q') £ Q x Q': 

( V A,f,qi,...,q n ,q A V B,f,q' 1 ,...,q' n ,q' A P( 9 ll g') A P(q 2 ,q' 2 ) A ... A P(q n ,q>j) P(q,q') 

(ii) for every {q,q') £ Q x Q'\ (p( q , q >) A v A ,F, q ) ~<VB,F',q'- 


Each of these formulas simplifies to a single clause (a disjunction of literals). 

We remark that we will employ this translation for the case that B consists of the set 
of terms containing redex occurrences with respect to a given rewrite system R. Then B 
is known and fixed before the translation to a satisfiability problem. As a consequence, we 
know the truth values of Vbj, q' lt ...,q' ,q' an d VB,F>,q' in the formulas above, and can immediate 
skip the generation of formulas that are trivially true (the large majority in case (i)). 

► Remark 26 (Complexity of the SAT encoding). While the encoding is efficient for string 
rewriting systems, it suffers from an ‘encoding explosion’ for term rewriting systems con¬ 
taining symbols of higher arity. The problem arises from the SAT encoding of the recursive 
computation of the interpretation of terms (described in Remark |17|). The computation of the 
interpretation of a term f(t±,... ,f„) containing m variables needs 0(\Q\ m+n+1 ) clauses: m 
for the quantification over the variable assignments, n for the possible states of t \,... ,t n and 
1 for the possible result states. To some extend, this problem can be overcome by ‘uncurrying’ 






10 


Proving Looping and Non-Looping Non-Termination by Finite Automata 


the system, that is, for every symbol / of arity n > 2 we introduce fresh symbols /i,..., /„_i 
of arity 2 and then replace all occurrences of f(ti ,..., t n ) by /„_i(... / 2 (/i(ti, £ 2 ), £ 3 ) ■ ■ ■, t n ). 
This transformation helps to bring the complexity down to 0(|Q| m+3 ). Nevertheless, for 
example for the S-rule, which only contains binary symbols, we still need |Q | 6 clauses. We 
note that after the uncurrying transformation, an automaton with more states may be needed 
to generate ‘the same’ language. 


4.2 Disproving Weak Normalization 

We are now ready to use Theorem [3] in combination with tree automata for automatically 
disproving weak normalization. The language L in the theorem is described by a non- 
deterministic tree automaton. In the previous section, we have seen how the relevant 
properties of tree automata can be checked. Here, we summarize the procedure: 


► Technique 27. Let R be a left-linear TRS. We search for a tree automaton A = (Q^jF^S) 
such that C(A) fulfills the properties of Theorem [3j 

( i ) We guarantee C(A) fl NF(— >•) = 0 by the following steps: 


_ We employ Lemma 20 to construct a deterministic, complete automaton B = ( Q, £, F, 5) 
that accepts the set of terms containing redex occurrences with respect to R. 
m Then the automaton B = ( Q,Q\T,,F,S) accepts all ground normal forms. 

- We use Lemma 23 to check that C(A) n C(B ) = 0 (thus C(A) C C(B)). 

(ii) We guarantee that C(A) is closed under —>• by Lemma [l 6 | 


(in) We use Lemma 14 to ensure that £(A) 7 ^ 0 . 


These conditions can be encoded as satisfiability problems as described in Remarks 10 13 
25] 1_5 and [TT] This enables us to utilize SAT solvers to search for suitable automata A. 


► Remark 28. We note the combination of Technique [27] with Remark [9] is complete with 
respect to disproving weak normalization on regular languages: if there exists a regular 
language L fulfilling the conditions of Theorem [3] then weak normalization can be disproved 
using Technique [27] after eliminating collapsing rules as in Remark [9] 

This can be seen as follows. In the work a generalized method for ensuring closure 
of the language of automata under rewriting has been proposed. Thereby the condition 
£a q => ra q of Lemma 16 is weakened to 


la q 


ra p for some p > q . 


(1) 


Here < is a quasi-order on the states Q and the automaton must be monotonic with respect 
to this order, see Definition [35] The monotonicity guarantees that the language of the 
automaton is closed under rewriting. 

In fM\ it has been shown that this monotonicity property is strong enough to characterize 
and decide the closure of the regular languages under rewriting. In particular, the language 
of a deterministic tree automaton is closed under rewriting if and only if there exists such a 
monotonic quasi-order on the states. 

Let R be a TRS such that there exists a regular language that satisfies the conditions of 
Theorem [3] Then there exists a deterministic, complete automaton A accepting this language 
and a quasi-order < on the states satisfying (JT|) and monotonicity. Let R! be obtained from 
R by eliminating collapsing rules as described in Remark [9] We obtain a non-deterministic 
automaton A' that fulfils the requirements of Technique [27] for R' by closing the transition 
relation of A under <: we add f(qi ,..., q n ) q whenever q < p and f(qi, ..., q n ) p. As 
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a consequence of monotonicity and using induction over the term structure, we obtain for all 
terms t £ T( E, X) with t qL X and a : X —y Q that 


(*) t * A f q if and only if t p for some p with q < p. 

As a consequence of (*) and monotonicity we have C(A') = C(A) (roughly speaking, if q < p, 
then q accepts a subsets of the language of p). Thus C(A') D NF(—►) = 0 and C(A') ^ 0 are 
guaranteed. Finally, we show that Lemma [l6| is applicable for R! and A'. Let £ — y r £ R!, 
a : X —> Q and q £ Q such that £a ~^* A , q. Then by (*) we get £a q' for some q' £ Q 
with q < q'. By (|T|) we have that ra q" for some q" £ Q with q' < q". Again by (*) we 
obtain that ra q. Hence the conditions of Technique 27 are fulfilled for R' and A'. 


► Example 29. We consider the following string rewriting system: 


aL —y La 


Ra —^ aR bL —y bR Rb —y Lab 


This rewrite system is neither strongly nor weakly normalizing, but does not admit looping 
reductions, that is, reductions of the form s — tsr. An example of an infinite reduction is: 

bLb —y bRb —y bLab —» bRab —y baRb —y baLab —y bLaab —y bRaab 


It is easy to check that the automaton Alr from Example [6] fulfills the requirements of 
Technique |27[ Hence, the system is not weakly normalizing. 

► Example 30. We consider the S'-rule from combinatory logic: 


a(a(a(S, x), y),z) ->• a{a{x, z),a{y, z)) 

For the A-rule it is known that there are no reductions t —y* C[t] for ground terms t, see [521 . 
For open terms t the existence of reductions t — C[ta\ is open. 

It is straightforward to verify that the automaton As from Example [7] fulfills the require¬ 
ments of Technique [27| and hence the 5-rule, and in particular the term SSS(SSS)(SSS{SSS)), 
are not weakly normalizing. 

► Example 31. The <5-rule (known as Owl in Combinatory Logic) is even simpler: 


5xy —y y(xy), or equivalently a(a(5,x),y) —y a(y,a(x,y)) . 


As shown in [4], this rule does not admit loops, , and the techniques in (7| fail for this system. 
The Technique |27| can be applied to automatically disprove weak-normalization for this rule. 
Our tool finds a tree automaton that has 3 states and accepts the language of all ground 
terms with two occurrences of SS. In fact, this is precisely the language of non-terminating 
ground 5-terms, see further [4]. 


In all examples until now infinite reductions exist of the regular shape based on ta n r 
rewriting to a term having ta^ n ^Ty as a sub-term, for every n, for some term t and 
substitutions a, r, y and an ascending linear function /. For instance, the 5 rule (Example 


30) admits an infinite reduction implied by ta n r rewriting to a super-term of ta n+1 r, for 


t = a(x,x), <j{x) = Ax, t(x) = 5A(5AA), for A = SSS. 


► Example 32. The following example does not have an infinite reduction of this regular 
shape, neither of the more general patterns from [50] and 0. 


aL —y La 


Raa —y aaaR 


bL —y bRa 


Rb —y Lb 


Rab —y Lab. 
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In this system bRa n b rewrites to bRa^^b for / defined by f{2n) = 3n + 1 and /(2n + 1) = 
3n + 2 for all n. This obviously yields an infinite reduction, but / is not linear, by which this 
example is outside the scope of [3D] and [?j. In our approach a proof of non-termination and 
even non-weak-normalization is extremely simple: ba*{L \ R)a*b is non-empty, closed under 
rewriting and does not contain normal forms. 


4.3 Disproving Strong Normalization 

For disproving strong normalization based on Theorem[2] the only difference with Technique [27] 
is that checking that L is closed under —> by Lemma [16] has to be replaced by checking that 
L is weakly closed under —> by Lemma |18[ The technique is applicable to string and term 
rewriting systems, and can be automated as described in Technique |27| and Remark |19| 

► Example 33. Let us consider the rewrite system 

aaL Laa Ra —> aR bL —> bR Rb —> Lab Rb —> aLb 

This system is non-looping and non-terminating. However, in contrast to Example |29| this 
system is weakly normalizing, since by always choosing the fourth rule the last rule is never 
used, and the first four rules are terminating. Hence the Technique |27| is not applicable for 
this TRS. However, the following pattern extends to an infinite reduction 

bRa 2n b 2n ba 2n Rb ba 2n Lab bLa 2n+1 b -> 


bRa 2n+1 b -> 2n+1 ba 2n+1 Rb -> ba 2n+2 Lb -> n+1 bLa 2n+2 b -> bRa 2n+2 b. 


Instead of finding this pattern explicitly, 
non-termination is also concluded from 
checking that b(aa)*(L \ R \ aR)a*b 
describes a language satisfying all condi¬ 
tions from Theorem [2] A corresponding 
automaton is given on the right. 

a 

The conditions of Theorem [2] are now checked as follows. Non-emptiness follows from the 
existence of a path from state 0 to state 4. Every path from 0 to 4 either contains one of 
the patterns aaL, Ra, bL or Rb, so it remains to show weakly closedness under rewriting by 
Lemma |T8] In the setting of string automata this means that for every left hand side l and 
every £-path from a state p to a state q we should find a zt-path from p to q for a string u 
such that t rewrites to u in one or more steps. For i = aaL the only path is from 1 to 2, for 
which there is also an Laa path. For l = Ra there is a path from 1 to 2, for which there is 
also an aR path via 3. The only other option for £ = Ra is a path from 3 to 2, for which 
there is also an aR path via 1. For i = bL the only path is from 0 to 2, for which there is 
also a bR path. Finally, for i = Rb there is a path from 1 to 4, for which there is also an Lab 
path and a path from 3 to 4, for which there is also an aLb path, by which all conditions 
have been verified. Note that for the Rb- path from 1 to 4 it is essential to use the 4th rule, 
while for the Rb- path from 3 to 4 it is essential to use the last rule. 



This example can also be treated by the technique introduced in the following section. 
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■ Improved Methods for Disproving Strong Normalization 

In this section, we improve the method for proving non-termination. The methods introduced 
so far are not able to handle the following example. 

► Example 34. We consider the following string rewriting system: 

zL —> Lz Rz zR zLL —>• zLR RRz —> LzRz 

This rewrite system is weakly normalizing but not strongly normalizing. The non-termination 
criteria introduced in the previous sections are not applicable for this system. Let us consider 
the first steps of an infinite reduction: 

zLLzzRz 

—> zLRzzRz —> zLzRzRz —> zLzzRRz 
—> zLzzLzRz —► zLzLzzRz —> zLLzzzRz 


Note the underlined occurrences of zLL. Due to the rule zLL —> zLR , the word zL is the 
marker for ‘turning’ on the left; However, this marker zL is itself a redex. To obtain an 
infinite reduction, this marker must not be reduced. 

The idea for proving non-termination of systems like Example [34] is to let the automaton 
determine which redex to contract. To this end, we introduce a ‘redex selection’ function 

£ : Q V(R) 


that maps states of the automaton to sets of rules that may be contracted at the corresponding 
position in the term. The idea is that a redex £a in a term C[£a\ with respect to a rule 
£ — > r is allowed to be contracted if £a q with £ —> r £ £(<?)■ In this way, the automaton 
determines what redexes are to be contracted. Then the automaton only needs to fulfill the 
property £a q => ra q for the selected rules: 

£ —» r £ £(q) A £a -^>* A q => ra q 


for every rule £ —>• r £ R, state q £ Q and a : X — >■ Q. Moreover, as proposed in [3 ED HU , we 
weaken the requirement ra q to ra p for some p > q. Here < is a quasi-order on the 
states and the automaton must be monotonic with respect to this order (see Definition 35 ). 
The monotonicity guarantees that the language of the automaton is closed under rewriting. 
For the present paper, this closure property holds only for the rules selected by £. 


► Definition 35 (Monotonicity). A tree automaton A = (Q,H,F,d) is monotonic with 
respect to a quasi-order < on the states Q if the following properties hold: 


(i) For all / € S with arity n and states ai < &i, ci2 < &2, ■ • • > a n < b n , it holds 


/(ai,..., a n ) q 


f(bi ,..., b n ) P for some p € Q with q < p 


(ii) Whenever q G F and q < p, then p G F. 

The following lemma is immediate by induction on the size of the context. 

► Lemma 36. Let A = (Q,S, F, 5) be a tree automaton that is monotonic with respect to a 
quasi-order < on the states Q. Let a,b G Q with a < b. Then for all ground contexts C we 
have that C[a] a 1 with a' G Q implies that C[b\ b' for some b' £ Q with a' <b'. 
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► Definition 37 (Runs). Let A = (Q,Y,,F,6) be a tree automaton and t £ T(£, 0). A run 
of A on t is a function p : Pos(t) —► Q such that for every p £ Pos{t) and t(p) = f £ S there 
is a rule f(p(pl), ■ ■ ■, p(pn )) p(p) in <5. The run p is accepting if p(e) £ F. 

Note that there is a direct correspondence between runs on t and rewrite sequences t q. 
We are now ready to state the generalized theorem for disproving strong normalization. 

► Theorem 38. Let R be a left-linear TRS. Let A = (Q, £, F, 5) be a tree automaton with 
C(A) ^ 0, < a quasi-order on the states Q, and £ : Q —> P(R) a function, called redex 
selection function. Assume that the following properties hold: 


(a) The automaton A is monotonic with respect to <. 

(b) For every state q £ Q, rule £ -A r £ R and a : X —> Q it holds that: 

£a q A £ — > r £ £(q) => (3 p £ Q. q < p A ra p) V 

(3 r' < r. 3 q £ F. r'a q) 

(c) For every term t £ T(£, 0) and accepting run p on t there is a position p such that t\ p is 
an instance of the left-hand side of a rule t —> r £ f(p(p))- 

Then R is not strongly normalizing. 


Proof. Assume that the conditions of the theorem are fulfilled. To disprove strong normaliz¬ 
ation of —► it suffices to disprove strong normalization of —► o > where > is the (non-strict) 
sub-term relation. We show that C{A) and —> o > fulfill the requirements of Theorem [2] 
Let t £ T(A). Then there exists an accepting run p of A on t. By item |(c)| there exists a 
position p £ Pos(t ) and a rule <Ar£ f(p(p)) such that t\ p is an instance of £. Then t\ p = to 
for some substitution er. By left-linearity, we can define a : Var{t) —> Q by a(x) = p{pp') 
whenever £j p > £ X. Then ia p(p) and we distinguish cases according to item (b) 


1. There exists q £ Q with p{p) < q and ra q. We know that t[£o] p = t ~^* p(e) and 
define tf = t[ro] p . Note that t — > if and t -» o > t'. We have t t[p(p)\ p{e) and 
p(p) < q. By Lemma 36 we have t[q] q' for some q' > p(e) and by monotonicity q' £ F. 
Hence t' = t[ro] p t[q\ ^>* q'. Thus t' £ C{A) and t —> o > t'. 

2. There exist r' < r and q £ F such that r'a q. Then r'o q and hence r'o £ C(A). 
Moreover, t —> o > r'o. 


This shows that C{A) contains no normal forms and is weakly closed under —> o >. By 
Theorem [2] — > o > is not strongly normalizing and hence —> is not strongly normalizing. ◄ 


► Remark 39 (SAT encoding of the conditions of Theorem [38] ) . To encode the conditions of 
Theorem [38] as a Boolean satisfiability problem we proceed as follows. For every q, q' £ Q, 
we introduce a fresh variable 


v< t q iq ' with the intended meaning: u< jgi9 ' is true <*=> q < q' , 

and for every q £ Q and £ ^ r £ R a fresh variable 

V£ }q ^ r with the intended meaning: is true £ —> r £ f(q) . 


The conditions (a) |(b) and|(c)|of Theorem 38 can be encoded as follows: 
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(a) For condition (a) we proceed as follows. We ensure that < is a quasi-order using the 


for every q £ Q 
for every q l7 q 2 ,q 3 G Q 


(reflexivity) 

(transitivity) 


following formulas 
v <,q,q 

V <,qi,Q2 A v <,q 2 ,q3 —* V <,qi ,93 

For monotonicity of the automaton with respect to < we introduce fresh variables 

rn f,qi,-~,q#(.f),q,q' 

and formulas 

m f,qi,- -,q#(f),q,q' v <,q,q' ^ v f,qi,---,q#(f),q' 

for every / £ £ and qi,..., q#(f), q, q' G Q ■ Then monotonicity translates to formulas 

for every q, q' £ Q 1 and 

^ v <,qi,q'i V > q#u)> i’i' 

q'GQ 

for every q 1 ,..., q#( f ), q € Q, i G {1, ■ - ■, #(/)} and q[ G Q. 

(b) For condition |(b)| we adapt Remark |l7| For every t £ U, assignment a : Var(t) — ► Q and 
q £ Q, we have fresh variables 


v t . 


with the intended meaning: Vt, a ,q is true 


ta ~~s>* q . 


with the corresponding formulas (ensuring the intended meaning) as in Remark 17 For 
every £ —► r G R, a : Var(£) —> Q and q, q' £ Q 1 we introduce a fresh variable rn r ^ aiq ^' 
and formula 


m r 




Likewise, for every £ —► r £ R, r' < r, a : Var(r') —> Q and q £ Q, we add a fresh variable 


m F , 


and formula 


niF r 


v F q A v r 


Then condition (b) translates to 


V£,ot,q A t ^r z a,q,q'^) V ( F,r',a,q ) 

q'&Q r'<r, qeQ 

for every £ —>• r G R, a : Var (£) —► Q and q £ Q. 


(c) For condition (c) we extend Remark 25 as follows. Let £\,... ,£ n be the left-hand sides 
of rules in R. Let B be the automaton and ,..., Ff n the sets of states obtained from 
Lemma |20| We construct the product automaton Ax B, and then we compute those 
states that are reachable without passing states (q,q’) for which there exists £ —> r £ £(q) 
such that q' £ Fi (that is, the rule £ —> r is selected by A and B confirms that the term 


is an instance of £). We adapt the encoding of Remark 25 as follows: 
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(i) for every / € E with arity n and (ft, q[),. ■ ■, (q n , q'„), (q, q') € Q x Q': 

( v A ,f, qu ..., q „, q A v B j, q ' 1 ,..., q ' ri y A p (quq [) A P( q 2 y 2 ) A ... A P( qn , q >j 

) ->■ P( q , q ') 


(ii) for every (q,q') gQ xQ’: p („,„>) -t ~<v AtFiq . 

In (i), due to the added condition, we do not consider all reachable states (q,q') but only 
those for which there exists no rule £ —► r G R that is applicable at the root of the current 
subterm (q' G Fg) and that is activated {vg. q j^ r ). The formulas in (ii) guarantee that no 
state (q, q'), that is reachable in this sense, is accepted by A. 

► Remark 40 (Incompleteness with respect to Loops). We note there exist (left-linear) TRSs 
systems that admit loops but cannot be proven non-terminating using Theorem |38| For 
example, consider the following rewrite system: 

a->b f(a,b)->g(a) g(x) -> f(x, x) 

This system admits the loop g(a) —> f(a,a) —> f(a,b ) —> g(a). However, it is not possible to 
prove non-termination of this system by Theorem |38| We sketch the argument. Assume that 
there was a tree automaton A fulfilling the conditions of the theorem. Then the automaton 
admits the rewrite step g(a) f(a,a). Since both a’s in /(a, a) are copies of the a in g(a), 
the automaton ‘interprets’ both occurrences of a as the same state. Now, to obtain an infinite 
rewrite sequence, the rule a —» b must be activated for the right a in /(a, a). However, then 
this rule is also activated for the left a, but contracting the left a leads to a normal form. 


A 


A 




t^r£R, q’GF c 
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Experimental Results 


We have implemented the improved method for disproving strong normalization (Theorem |38| 
presented in this paper. For the purpose of evaluating our techniques, the tool applies only 
the methods presented in this paper, and no other non-termination method like loop checks. 
The SAT solver employed for the evaluation results in this section is MiniSat [B]. Our tool 
can be downloaded from http://joerg.endrullis.de/non-termination/. 

Our tool can automatically prove non-termination of all examples in this paper, including 
the S-rule and the d-rule. The following table shows the size of the automata that are found 
by the tool as witnesses for non-termination for the examples in our paper: 


Example 

29 


30 


31 


32 


33 


34 

Number of states 

4 

5 

3 

4 

5 

6 


Each of these automata has been found within less than a second on a dual core laptop. 

We have also evaluated our methods on the database used in [7] , consisting of 58 non¬ 
terminating term rewriting systems that do not admit loops. The tool AProVE recognizes 44 
systems as non-terminating; an impressive 76%. An extension of AProVE with our method 
would increase the recognition by 8.5% to 84.5%. In other words, our method succeeds on 
36% (that is 5 systems) of the remaining 14 systems for which AProVE did not find a proof. 
These 5 systems are: 


nonloop/TRS/emmes/ex3_4.trs 
nonloop/TRS/own/challenge_fab.trs 
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h nonloop/TRS/own/downfrom.trs 
m nonloop/TRS/own/ex_payet.trs 
h nonloop/TRS/own/isList-List.trs 

In total, our tool succeeds for 26 of the 58 non-looping examples from [7j. The results 
suggest that our method and that of [7] are complementary and should be combined for 
maximum strength. The paper [7j explicitly mentions that the following example is beyond 
their techniques (this example is not part of the database above): 

/(true, true, x, s (y)) —» f{isNat{x), isNat(y),s(x), double(s(y))) 
isNat( 0) —> true 
isNat(s(x)) —> isNat(x) 
double(0) —> 0 

double(s(x)) —> s(s (double(x))) 


Our non-termination techniques can handle this system: the tool finds an automaton with 6 
states within 3 seconds (using the transformation from Remark 26 ). 

Finally, we have evaluated the tool on the termination problem database (TPDB). We 
have run our tool on all string and term rewriting systems (of the standard categories) that 
remained unsolved during the last full run of all tools in December 2013. For string rewriting, 
our tool was able to disprove termination for 13, and for term rewriting, for 8 systems of 
the unsolved systems. This corresponds to an increase of strength of 11.5% (114 + 13) for 
string rewriting and of 3% (274 + 8) for term rewriting. Let us mention that many of the 13 
string rewriting systems actually admit loops, but very complicated ones, that are not found 
by the standard tools. These loops have been found in previous competitions by the tools 
Matchbox [33) and Knocked for Loops 133- 


7 Conclusions and Future Work 

In this paper, we have employed regular languages for proving non-termination. Instead of 
searching for an infinite reduction explicitly we search for a regular language with properties 
from which non-termination easily follows. After encoding these properties in a propositional 
formula, the actual search is done by a SAT solver. In some examples, like Example [32] 
a very simple corresponding regular language is quickly found by our approach, while the 
actual infinite reductions have a non-linear pattern being beyond earlier approaches. 

For future work, it is interesting to investigate whether this approach can be extended to 
context-free (tree) languages; such an approach could potentially also generalize [7j. The 
question is whether there are efficient criteria to check the conditions of Theorem [2j For 
example, consider the following string rewriting system: 

bB —> Bb bed —> BcD Dd —> dD 

aX -* abb BX -* Xb bed -* XcY YD -* dY Ye-* dde 

This system admits for every n > 1 reductions of the form 

ab n cd n e -** a B n ~ x bed D n ~ x e —>■* a B^ 1 XcY D 11 " 1 e-** ab n+1 cd n+1 e 

As a description of this pattern needs a context-free language, it is unlikely that a regular 
language exists that fulfills the requirements of Theorem [2j 
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As described in Remark 26 the SAT encoding of (non-deterministic) automata is not 
efficient for symbols of higher arity. We think that these problems can be overcome by more 
efficient encodings of automata. For example, the uncurrying transformation mentioned 
in Remark 26 can be seen as a restriction of the shape of the automata (the transition is 
computed argument by argument) instead of a transformation on the system. It would be 
interesting to investigate what other restrictions would lead to a more efficient representation 
of automata as Boolean satisfiability problems. Results in this direction can be of interest in 
various areas where automata are applied. 

We think that it is also interesting to investigate whether the characterization of strong 
and weak normalization (Theorems [2] and [3]) can be adapted to the setting of infinitary 
rewriting EB E23 E! with infinite terms and ordinal-length reductions; the interesting 
properties then are infinitary strong and weak normalization. 

Finally, we note that equality of streams P3 [15], 35, HI] (infinite sequences of symbols) 
can be rendered as a non-termination problem (a comparison program running indefinitely 
if the streams are equal, and terminating as soon as a difference is found). It remains to 
be investigated whether non-termination techniques can be employed fruitfully for proving 
stream equality. 
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